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Abstract 



(N; 

\^ . In many problems, the inputs to the problem arrive over time. As each input is received, it must be 

p^ I dealt with irrevocably. Such problems are online problems. An increasingly common method of solving 

j^ . online problems is to solve the corresponding linear program, obtained either directly for the problem 

O ' or by relaxing the integrality constraints. If required, the fractional solution obtained is then rounded 

online to obtain an integral solution. 

We give algorithms for solving linear programs with mixed packing and covering constraints online. 
We first consider mixed packing and covering linear programs, where packing constraints Px < p are 
I— i' given offline and covering constraints Cx > c are received online. The objective is to minimize the 

C/^ . maximum multiplicative factor by which any packing constraint is violated, while satisfying the cover- 

ing constraints. For general mixed packing and covering linear programs, no prior sublinear competitive 
lyj . algorithms are known. We give the first such — a polylogarithmic-competitive algorithm for solving 

O ' mixed packing and covering linear programs online. We also show a nearly tight lower bound. 

Our techniques for the upper bound use an exponential penalty function in conjunction with mul- 
^\i ' tiplicative updates. While exponential penalty functions are used previously to solve linear programs 

^ . offline approximately, offline algorithms know the constraints beforehand and can optimize greedily. In 

•/^ ' contrast, when constraints arrive online, updates need to be more complex. 

^^ . We apply our techniques to solve two online fixed-charge problems with congestion. These prob- 

v(»^ ' lems are motivated by applications in machine scheduling and facility location. The linear program for 

• . these problems is more complicated than mixed packing and covering, and presents unique challenges. 

^L} ' We show that our techniques combined with a randomized rounding procedure can be used to obtain 

*vj i polylogarithmic-competitive integral solutions. These problems generalize online set-cover, for which 

^-H ' there is a polylogarithmic lower bound. Hence, our results are close to tight. 
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1 Introduction 

In this paper, we give the first onUne algorithm for general mixed packing and covering linear programs 
(LPs) with a sublinear competitive ratio. The problem we study is as follows. 

Online Mixed Packing and Covering (OMPC). Given: a set oi packing constraints Px < p, and a set of 

covering constraints Cx > c, with positive coefficients and variables x, such that the packing constraints 
are known in advance, and the covering constraints arrive one at a time. Goal: After the aixival of each 
covering constraint, increase x so that the new covering constraint is satisfied and the amount A by which 
we must multiply p to make the packing constraints feasible is as small as possible. 

Mixed packing and covering problems model a wide range of problems in combinatorial optimization 
and operations research. These problems include facility location, machine scheduling, and circuit routing. 
In these problems, requests for resources such as bandwidth or processing time are received over time, or 
online, whereas the set of resources is known offline. As each request arrives, we must allocate resources 
to satisfy the request. These allocations are often impossible or costly to revoke. The resources correspond 
to packing constraints in our setting and are known offline. Requests correspond to covering constraints, 
and arrive online. The performance of an online algorithm is measured by the competitive ratio, defined as 
the worst case ratio of the value of the solution obtained by the online algorithm to the value obtained by 
the optimal offline algorithm which has as its input the entire sequence of requests. The worst case ratio is 
over all possible sequences of inputs. 

Many techniques to solve integer problems onUne first obtain a fractional solution, and then round this 
to an integer solution |[Tl|2j|5j|6l. The first step involves solving a linear program relaxation of the original 
problem online. In fact, this can be a bottleneck step in obtaining a good competitive ratio. Thus, our 
algorithm for online mixed packing and covering can provide an important first step in obtaining good 
online solutions to several combinatorial problems. We demonstrate the power of our ideas by extending 
them to give the first online algorithms with sublinear competitive ratios for a number of fixed-charge 
problems with capacity constraints. For these problems, we first solve the linear program relaxation online, 
and then use known randomized rounding techniques to obtain an integer solution online. 

Applications. We use our techniques to study two problems with fixed-charge and congestion costs. Both 
fixed-charge problems and congestion problems are widely studied offline and online; we discuss specific 
applications and references below. In general, fixed charges are used to model one-time costs such as 
resource purchases or installation costs, while congestion captures the load on any resource. In machine 
scheduling, for example, the makespan can be modelled as the maximum congestion by setting each re- 
source to be a machine and setting unit capacity for each machine. 

Application 1: Unrelated Machine Scheduling with Start-up Costs (UMSC). Given offline: a set of 
machines {1, . . . , m} with start-up cost Cj for machine i. Jobs arrive online, and job j requires pij time 
to be processed on machine i. Goal: when a job j arrives, determine whether to "open" new machines 
by paying their stait-up cost, and then assign the job to one of the open machines, so that the sum of the 
makespan of the schedule — the maximimum over machines of the processing times of the jobs assigned 
to it — and the sum of start-up costs is minimized. 

The problem of scheduling jobs to minimize the makespan and the fixed charges is studied both of- 
fline |[r7ll26l and online |[T4ll25ll24ll . The problem is motivated by reducing energy consumption in large 
data centers, such as those used by Google and Amazon 1111261. The energy consumption of a large data 
center is estimated to exceed that of thousands of homes, and the energy costs of these centers is in the tens 
of millions of dollars ll32l . hence algorithms that focus on reducing energy consumption are of practical 
importance. The inclusion of a fixed charge models the cost of starting up a machine. Thus machines do 
not need to stay on, and can be started when the load increases. Bicriteria results for the offline problem are 
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given in |[26l and ifTTl using different techniques. We show strong lower bounds for bicriteria results in the 
online setting, and therefore focus on algorithms for the sum objective. For the online problem with iden- 
tical machines, |[T4l [25l give constant-competitive algorithms for the sum objective. These are extended to 
the case where machines have speed either 1 or s, with more general costs for the machines in Il24l . 



Application 2: Capacity Constrained Facility Location (CCFL). Given offline: a set of facilities T with 
fixed-charge Cj and capacity Uj for each facility i in T . Clients arrive online, and each client j has an 
assignment cost aij and a demand "Pij on being assigned to facility i. Goal: when client j arrives, determine 
whether to open new facilities by paying their fixed charge, and then assign the client to an open facility, so 
that the sum of the maximum congestion of any facility, total assignment costs, and the total fixed charges 
paid for opened facilities is minimized. The congestion of a facility is the ratio of the sum of the loads of 
clients assigned to the facility to the capacity of the facility. 

For online facility location without capacity constraints, a ( ^^"^"^ j -competitive ratio is possible 

when the assignment costs form a metric |[T8l . and a 0(log7Tilogn)-competitive ratio is possible when 
assignment costs are non-metric |[l], with n cUents and m facilities. Capacitated facility location is a natural 
extension to the problem. In the offline setting, constant-factor approximation algorithms are known for 
both facility location with soft capacities — when multiple facilities can be opened at a location — and hard 
capacities — when either a single facility or no facility is opened at each location |[30l[35l . Our problem is 
a variant of non-metric soft-capacitated facility location where instead of minimizing the cost of installing 
multiple facilities at a location, we minimize the load on the single facility at each location, in addition to 
fixed-charge and assignment costs. 

Our Results. We give polylogarithmic competitive ratios for the problems discussed. Our results ai^e the 
first sublinear guarantees for these problems. 

• ForOMPC: 

- A deterministic 0(lnmln(dpK))-competitive algorithm, where m is the number of packing 
constraints, d is the maximum number of variables in any constraint, p is the ratio of the max- 
imum to the minimum non-zero packing coefficient and k is the ratio of the maximum to the 
minimum non-zero covering coefficient (Section |2]i. If all coefficients are either or 1, this 
gives a O (In mind) -competitive algorithm. 

- A lower bound of 0(lnmln(d/lnm)) for any deterministic algorithm for OMPC. Our algo- 
rithm for OMPC is thus nearly tight (Section IZ3]|. 

• For CCFL and UMSC: 

- A randomized 0{\n.(mnp) ln^(mn))-competitive algorithm for CCFL, where m and n ai^e the 
number of facilities and clients respectively, and p is is the ratio of the maximum to the min- 
imum total cost of assigning a single client (Section O. We obtain the same competitive ratio 
for UMSC, where m and n are the number of machines and jobs respectively, and p is the ratio 
of the maximum to the minimum total cost of assigning a single job. 

- A lower bound for bicriteria results for CCFL: even if the maximum congestion T is given 
offline, no deterministic online algorithm can obtain a fractional solution with maximum con- 
gestion o{jn)T and fixed-charge within a polylogarithmic factor of the optimal (Section [33] ). 
This lower bound also holds for UMSC , where T is the makespan. 

Since each of our applications include fixed-charges as part of the objective, they generalize online set 
cover. In UMSC, for example, set cover is obtained by setting the processing times to be either zero or 



infinity. Since tlie makespan in any bounded solution to tlie problem is now zero, this reduces the problem 
to covering jobs with machines to minimize the sum of machine startup costs. Online set cover has a lower 
bound of [^(logmlogn) on the competitive ratio assuming BPP ^ NP |[3l. Thus, our results for UMSC 
and CCFL are tight modulo a logarithmic factor. 

Our Techniques. Our techniques for online mixed packing and covering are based on a novel extension 
of multiplicative weight updates. We replace the packing constraints in our problem with an exponential 
penalty function that gives an upper bound on the violation of any constraint. When a covering constraint 
arrives, the increment to any variable is inversely proportional to the rate of change of this penalty function 
with respect to the variable. We use a primal-dual analysis to show that this technique, combined with a 
doubling approach used in previous online algorithms, gives the required competitive ratio. While expo- 
nential potential functions are widely used for offline algorithms and machine learning, e.g. |@], our work 
is the first to use an exponential potential function to drive multiplicative updates that yield provably good 
competitive ratios for online algorithms. 

Our work is closely related to work on solving pure packing packing and pure covering linear programs 
online, and Lagrangean-relaxation techniques for solving linear programs approximately offline. Multi- 
plicative weight updates are used in lITOl to obtain 0(logn)-competitive fractional solutions for covering 
lineal- programs when the constraints arrive online. In lITOl , the cost is a simple linear function of the vari- 
ables. The update to each variable is inversely proportional to the sensitivity of the cost function relative to 
the variable, given by the variable's coefficient in the cost function. In our problem, however, the cost is the 
maximum violation of any packing constraint. The cost function is thus nonlinear, and since its sensitivity 
relative to a variable changes, it is not apparent how to extend the techniques from lITOl . We use an expo- 
nential potential function to obtain a differentiable approximation to this nonlinear cost. For each variable, 
our updates depend on the sensitivity of this potential function relative to the variable. In addition to the 
primal-dual techniques in lITOl . a key step in our analysis is to obtain bounds on the rate of change of this 
potential function. 

A large body of work uses Lagrangean-relaxation techniques to obtain approximate algorithms for 
solving LPs offline, e.g., |[3Tl[34l . In these papers, the constraints in the LP are replaced by an exponential 
penalty function. In each update, the update vector for the variables minimizes the change in the penalty 
function. In this sense, the updates in these offline algorithms are greedy. Since the constraints are available 
offline, this gives e-approximate solutions. In our case, since covering constraints amve online, greedy 
algorithms perform very poorly, and we must use different techniques. We use an exponential penalty 
function similar to offline algorithms. However, our updates are very different. Instead of a greedy strategy 
as used in lOTllS?! . we hedge our bets and increment all variables that appear in the covering constraint. 
The increment to each variable is inversely proportional to its contribution to the penalty function. 

For fixed-charge problems with capacity constraints, we solve the corresponding linear programs on- 
line and, for our applications, round the fractional solutions to obtain integral solutions online. The linear 
programs for these problems are significantly more complicated than mixed packing and covering. We 
combine our techniques for mixed packing and covering with a more complex doubling approach to ob- 
tain fractional solutions, and adapt randomized rounding procedures used previously offline for machine 
scheduling Il26l and online for set cover lITTI to obtain integral solutions. 

Other Related Work. Multiplicative updates are used in a wide variety of contexts. They are used in both 
offline approximation algorithms for packing and covering problems (71 [16] [19] [20] |2T] [22] [27] [28] [29] |3l] 
[33] O, and online approximations for pure packing or pure covering problems based on linear programs 
such as set cover lfT2l . caching |J6T|, paging 15], and ad allocations Q. Both offline and online, these 
algorithms are analyzed using a primal-dual framework. Multiplicative updates are used earlier |[T] |2] to 
implicitly solve a linear program online for various network optimization problems. The fractional solution 



obtained was rounded online to obtain an integral solution. Multiplicative weight updates also have a long 
history in learning theory; these results are surveyed in [4]. 

Our work studies the worst-case behaviour of our algorithms assuming adversarial inputs. A large body 
of work studies algorithms for online problems when the inputs are received as the result of a stochastic 
process. Two common models studied in the literature are (1) when the inputs aie picked from a distribution 
(either known or unknown), and (2) when an adversary picks the inputs, but the inputs are presented to the 
algorithm in random order. The adwords and display ads problems can be modeled as packing linear 
programs with variables arriving online. A number of papers give algorithms for these problems assuming 
stochastic inputs; some of these results are presented in llT3l[T5l . 

2 Online Mixed Packing and Covering 

In this section, we consider mixed packing and covering linear programs. A mixed packing and covering 
lineal- program has two types of constraints: covering constraints of the fonii Cx > c, and packing con- 
straints of the form Px < p. We normalize the constraints so that the right side of each constraint is 1. Our 
objective is to obtain a solution x that minimizes the maximum amount by which any packing constraint is 
violated. Thus, our problem is to obtain a solution to the following linear program: 

minAs.t. Cx> 1, Px< A, X, A > 0. (1) 

The packing constraints are given to us initially, and the covering constraints are revealed one at a time. 
Our online algorithm assigns fractional values to the variables. As covering constraints arrive, the variable 
values can be increased, but cannot be decreased. 

For a vector v, we use both Vi and {v)i to denote its ith component. We use [n] to denote the set {1, 2, 
. . . , n}. The vector of all ones and all zeros is denoted by 1 and 0, respectively. 

The number of variables, number of packing constraints, and number of covering constraints in the 
linear program are denoted by n, m, and rric respectively. We use d to denote the maximum number of vari- 
ables in any constraint. We define p := max^ j- pkj/ mmk, j-.p,, ■>o Pkj and similarly k = maxj j Cij/ miiij j^Ci >o Qj- 
The value of k is used only in the analysis of the algorithm; we do not need to know its value during exe- 
cution. Define ki := maxj cij, i.e., ki is the maximum coefficient in the first covering constraint to arrive. 
Define di as the maximum number of variables in any packing constraint, and the first covering constraint. 
Define p := 1 + jj^r^^, and a := e^ h\{p,d^phi). Here, e is the base of the natural logarithm. 

We use OPT to denote the optimal value of A given P and C, hence OPT is the value returned by the 
optimal offline algorithm. 

In order to analyze our algorithm, we consider the dual of dD as well: 



Y, y^ s.t. C^y < P^z, X] Zfc < 1, y, z > (2) 



max _ _ 

i k=l 



2.1 An Algorithm for Mixed Packing and Covering Online 

We now give an algorithm for solving OMPC and show that it is 0{log{dpK) log m)-competitive. We 
assume in the following discussion that we are given a scaling parameter F > maxk. j Pkj / {di pKi) , which 
is used to scale the matrix of packing coefficients P. In Theorem [TTl we show that if 20PT > -^ > OPT 
then our algorithm yields the stated competitive ratio. Without this estimate F, we can use a "doubling 
procedure" commonly used in online algorithms, which increases the competitive ratio obtained by a factor 
of 4 (Section [ 



Given a vector x, let A(x) := maxfcg[f„](Px)fc. For a given scaling pai^ameter F, let P := P/F, 
Pkj ■= Pkj/r and A(x) := maxfcg[„,](Px)fc. Let est(x) := In \y2ke[m] exp(Px)fcj be an estimate of 
A(x), and note that maxfc(Px)fc < est(x) < maxfc(Px)fc + Inni. For each variable Xj, define 

^^i EfeGM exp(Px)fc 

Our algorithm is given as Algorithm[T] Upon receiving the first constraint, we initialize Xj ^ l/{d\pKi) 
for all j G [n] . We also initialize a counter variable / -^ 0. 

When a covering constraint (Cx)j > 1 arrives it gets assigned a new dual variable yi, and the variables 
are incremented as described. The dual variables y are used only in the analysis. 

For covering constraint i, define 

ej(x) := (^ - 1) mill ratej(x)/cij , (4) 

j:Cij>Q 

SO that for all j G [n], ei(x)cjj/ratej(x) < // — 1. In line[8l each variable Xj gets increased by at most a 
factor of p, and at least one variable gets incremented by a factor of /i. 

Algorithm 1 MPC-APPROX: Upon arrival of ith covering constraint: 



When first constraint arrives, initialize Xj ^ l/(df p^i) for all j G [n], and / ^ 0. 
Upon anival of zth covering constraint: 
while (Cx)j < 1 do 

/ ^ / + 1, x' ^ a; 

Vj, ratej ^ ratej( 

Ei ^ ei(x') 

for j G [n] do 

1 + e, 



x') 


/* defined in © */ 




/* defined in © */ 


c., \ 




ratej / 





9: Vi ^ yi + eej /* for analysis */ 

10: if A(x) > 31n(em) then return FAIL 



A single iteration of the while loop is a phase, indexed by I, and the first phase is phase 0. The value 
of the variables before they are incremented in phase Hs x'. x*^ denotes the values after initialization. 
For covering constraint i, Li is the indices of the phases executed from its arrival until (Cx)j > 1, and 
L := UiU 

We first show upper bounds on values attained by the variables, and on the running time. 

Lemma 1. During the execution of the algorithm, for any j G [n], Xj < fi/ niinj;cj >o Qj- 

Proof. For any Xj, if minj^c. >o CijXj > 1, then Xj will not be incremented further in any phase since any 
covering constraint i with qj > must already be satisfied. Thus, since the value of any variable increases 
by at most a factor of /_f in a phase, Xj < /.t/ minj:cj >o Qj- D 

Lemma 2. MPC-Approx executes 0{nln{fi(PpK,) Inm) phases, and each phase takes time 0{mn). 

Proof. In each phase, the value of at least one variable gets incremented by a factor of p. Each variable has 
an initial value of l/{d\pK,i). Let Uj be the number of phases in which Xj gets increased by a factor of p. 
Thenxj > p"-^ /{djpKi). Since by Lemma [B Xj < p/ mmi^c^j^oCij, Uj < log^(/xdfpft;i/minj:ci^.>o Qj). 
Observing that for all j, ki / uimi-a >o Cij < k and di < d, it follows that each variable can be increased 



by /i in at most logJ^cPpn) phases. Since in each phase at least one variable increases by a factor of fi, the 
number of phases is at most nlog {fid? pn) = nln{fid? pn) / In p. Since ln(l + x) > x/e for < 2; < 1, 
lii/U > l/(3eln(e?7i)). Thus the number of phases is at most 0{nlog{fj.cfpK) logm). In each phase, Px 
can be computed in 0{mn) time; then est(x) and each ratej(x) can be computed in time 0{m). Thus 
each phase takes time 0{mn). D 

Our proof of the competitive ratio follows from a primal-dual analysis. We show in Corollary |6]that 
In 771 + OPT jr plus the value of the dual objective maintained by the algorithm is an upper bound on the 
primal objective maintained by the algorithm. Lemmas |7] and [8] show how the dual variables maintained 
by the algorithm can be scaled down to obtain feasible dual values. We show in Theorem [TT] that together 
these prove the bound on the competitive ratio. 

We first show that the initialization of the variables ensures that est(x'^) does not exceed ^^^ + Inm. 

Lemma 3. For the variables as initialized, A(x'^) < OPT/F, and hence est(x'^) < ^^ — h Inm. 

Proof. Let x* be the values for the variables in an optimal solution. After the first covering constraint is 
received, 1 < X^, ^ijX* < max,, cir "^j x*. Since the first covering constraint has at most di variables, 
there exists variable xl > l/{di maxj. cir), and hence 

OPT = max(Px*)fc > min Pkjxl > min pkj / {di max cir) = inin pkj/{diKi). 

ke[m] k,j:pkj>0 k,j:pkj>0 r k,j:pkj>0 

Using p = maxkj Pkj /^i'n.k,j:pkj>o Pkj, 

OPT > max Pkj/{dipKi) = F max Pkj/{dipni) ■ (5) 

Our algorithm initializes x^ = l/{d\pK,i), and hence 

EJ 
A(x ) = niax(Px )j^ < diniaxpkj/{FdipKi) < viiayipkj/{FdipK,i) < OPT/F, (6) 

ke[m] k,j k,j 

where the first inequality is because any packing constraint has at most di variables. Thus, est(x'^) < 
A(x'^) + ln?7i < (OPT/F) + In 771, proving the lemma. D 

Corollary 4. If F > uiaK-kj Pkj / (dipni), then A(x ) < 31n(e?7T,) at the beginning of any phase I. 

Proof. Since F > m.axf^jPf^j/{dipKi), by ^, A(x'^) < 1. Thus the lemma is satisfied for the first phase. 
For any phase I > 0, the algorithm would have failed at the end of phase / — 1 if A(x) > 31n(em). Since 
the algorithm did not fail in phase / — 1, in any phase /, A(x') < 3 lii{em). D 

Lemma 5. If F > maxkj Pkj/idipni), the increase in the dual objective ^^ yi is an upper bound on the 
increase in est(x) in every phase. 

Proof. Let est' and est'+^ denote the values of est(x) before and after the variables ai^e incremented in 
phase I, respectively. We will show that est'"*"^ — est' < ee^, which is the increase in Ylit Vi in phase I. 

Let x' and x'+^ be the values of x before and after being incremented in phase I. For each Xj, let 
g-[t) := x^- + (a;^+^ - x^-)t for < t < 1. Note that ^^(O) = ^ and ^^(l) = x^^ . Define g(t) = 
{gi{t) , g2{t) , . . . ,gm{t)). With some abuse of notation, any function of x, say /i(x), can be viewed as a 
function of t, with h{t) := h{g{t)). Thus, the functions est(x) and ratej(x) can be written as functions of 
t: est(t) = lnJ2kelm] exp(Pg(t))fc, and 



rate,(t) = rate,(g(t)) = ^-^ = — ^-^^^^^ . (7) 

We use these alternate expressions in the remainder of the proof. By the chain rule, 

dest(i) s-^ dest{t) dgj(t) 

dt ~ ^ dgj{t) dt ' 

and hence, 

est'+^-est' = /' ^dt =y t rate,(t)%^dt. (8) 

In a phase, each variable is incremented by at most a factor of /i. Therefore x'+^ < /^tx'. Then since 
r > maxfcj pfcj/((iipKi), by Corollary |4] A(x') < 31n(e7n) in any phase /. Thus ratej(t) < eratej(O) for 
< t < 1 by Lemma |45] (in Appendix). Hence 

est'+^-est' < e^ratej(x') / -^j^dt = e ^ratej(x')(x5+^ - x^) . 

Since in phase I each variable Xj gets multiplied by 1 + e.i(x') , /^'j ,. , 

est'+^ - est^ < eei(x') y^rateo(x') — ^^ / ,^ = eei(x') y^ Cjox'- < eei(x') 

-^^ rate,(xM -^^ -^ 

i=i -^ i=i 

where the last inequality follows since, on entering the for loop, (Cx)j < 1. Since eej(x') is the increase 
in the dual objective, this proves the lemma. D 

Corollary 6. If F > mayikj Pkj / (dipKi), then J2i Hi ^ est(x) — ^ InTTi. 

Proof. By Lemma [5j the increase in X^^yj is an upper bound on the increase in est(x), thus YliVi — 
est(x) — est(x°). By Lemma[3l est(x°) < Inm + ^^j^, and hence J2i Vi ^ est(x) — Ihttt, — ^^^. D 

We now show that the dual variables do not violate the dual constraints by much. We choose the dual 
variable z^ con^esponding to each packing constraint k E [m] as 

exp((PxOfc) 

Zk := max , , ;.. (9) 

i&L exp(est(x'jj 



Lemma 7. For z as defined in ^, > z^ < ln(e7n) + max A(x 



feefml 






Proof. For each packing constraint k, let (l){k) := argmax^ '^^'^Z ^ id ■ Thus z^ attains its value in phase 
(j){k). We index the packing constraints so that 0(1) < (/>(2) < • • • < (j){m). Then for any r, A; G [m] with 
k > r so that (f){k) > (j){r), we have (Px''^^*''))^ > (Px''^^''))^ since the variables x are increasing. Thus, 

exp(est(x'^('=))) = ^ exp((Px'^W),) > ^exp((Px<^W),) > ^exp((Px<^W),) . (10) 

rG[m] r<k r<k 

Substituting ([TOll into I© yields 



exp((Px^W)fc) ^ exp((Px'^('=))fc) 



exp(est(x'^('=))) Er<fcexp(Px<^W)^ ' 
Then by Lemma l44l in the appendix, with a^ = exp((Px'^(*^^))fc), 

E^-^^ ^ l + l-(^ e,p(p,^(i))^ j ^ l + lnm + maxA(x), (11) 

where the last inequality follows since exp((Px''^('^'')i) > 1 and (Px')fc < A(x') by definition of A. D 

The next lemma tells us how much we must scale the dual solution obtained by the algorithm to obtain 
a dual feasible solution. 

Lemma 8. For any j G [n], (C'^y)j < (P'^z)j — . 

Proof. Consider a phase I executed upon arrival of a covering constraint i. In this phase, yi gets incremented 
by eej(x'). This increment occurs in every phase in Lj. Hence 

(C'^y)j = ^ Cijyi = e ^ Cij^ei(x'). (12) 

By Lemma[T] Xj < /u/minj:c, >o Cij. Further, since the initial value of Xj is l/{dlpKi) and is multiplied 



by (l + Ci j^ j in every phase, for all j G [n] , 

where the last inequality is since ejCjj/ratej < l/(31n(e?7i)) < 1 and for < a < 1, e"'^ < 1 + a. 
Multiplying on both sides by dfpKi, taking the natural log, and reversing the inequality, 

and multiplying both sides by e • max^g^ ratej(x'), 

y N^ ej(x )cjj < emaxratej(x ) In (/ud^yOK) . (13) 

Thus from (IT2l ) and (IT3T l. (C'^y)j < e^ max^g^ ratej(x') In (^fidlpn) < a raaxi^L ratej(x'). We will 
now show that max/g/, ratej(x') < (P^z)j, completing the proof. This follows since 

, , ;, EfcGMPfciexp((Px')fc) ^ ^ exp((Px')fe) ^ . 

max rate, x = max —-r— < > p^imax , ,,, = > PkiZu ■ 



i€L -^ i€L exp(est(xM) ^^ -^ leL exp(est(xO) 



D 



We now use the previous lemmas to prove the bound on the competitive ratio of our algorithm. 

Lemma 9. If F > 2aOPT, then MPC-APPROX does not fail. 

Proof. Let x-'^ and (y-^, z-^) be the values for the primal and dual variables when at line[TO]in the algorithm. 
X'^ may be infeasible for the primal since the current job may not yet be assigned, however, (y^, z^) are 
feasible for the dual. Let x* and (y*, z*) be the optimal solution. Then 

OPT = A(x*) = Y. yl (14) 

ig[7Tlc] 

where the last equality follows from LP strong duality. For convenience of notation, \t\.v := In(em) + 
A(x'^). Since x is non-decreasing, A(x'^) = max; A(x'). Then by Lemmas |7] and [8j z^ /u and y-^ • r/{au) 
are feasible values for the dual variables. Thus the optimal dual value ^^ y* is at least as lai^ge as ^^ yj • 
r/{(jv). From ©, OPT > uiaxkjPkj/dipKi. Hence if T > OPT, the condition for Corollary |6] is 
satisfied. From (IT4l ) and Corollary |6] 

OPT 

OPT > ^ y/ • r/{aiy) > (est(x-^) - Inm — ) • r/{au) , 

i 

or, rearranging terms, 

^OPT^ — + lnm > est(x-^). 

Substituting the value of u, and since est(x''^) > A(x'^), 

OPT / \ OPT 

a ( In(em) + A(x^) ) + —-- + In m > A(x-^) . (15) 



Using the bound on OPT from the statement of the lemma, and since cj > 1, 

In(em) + A(x-^) T/ a 

^ '— — i — ^+ln(em) > A(x^), 

and simplifying yields A(x^) < 3 ln(e?n). Hence, if i^ > 2aOPT, the algorithm does not fail. D 

Lemma 10. If AaOPT > P > msiXkjPkj/idipKi) and MPC-AppROX does not fail, it returns a 
8a ln{e7n) -competitive solution. 

Proof. Since P > nia:x.kjPkj/{dipKi), from (IT5] ). 

OPra('ln(em) + A(x^)) +OPT + rinm > rA(x^) = A(x-'') . 

Using the upper bound on P, and since A(x''^) < 31n(em) by CoroUaiyHJ 

40PTa In(em) + OPT + 40PTa Inm > A(xO • 

This proves the lemma. D 

Since OPT > mayikj Pkj/dipKi, Lemmas l9l and [TOl impl v 

Theorem 11. IfAaOPT > P > 2aOPT, then MPC-AppROX does not fail and returns a 8aln{em)- 
competitive solution. 



2.2 Proceeding Without an Estimate on OPT. 

We now discuss how to proceed without an estimate on OPT. We use a doubhng procedure commonly used 
in onhne algorithms. We initially set F = uiax^j-.p^. >oPkj/{dipni) and use this value to scale the packing 
constraints. We mn Algorithm MPC-APPROX with the scaled values. If the algorithm fails, we double F, 
scale the packing constraints by the new value of F and run the algorithm again. We repeat this each time 
the algorithm fails. 

Each execution of Algorithm MPC-APPROX is a trial. Each trial r has distinct primal and dual vari- 
ables (A(t), x(t)) and (y(T), z(t)) that ai^e initialized at the start of the trial and increase as the trial 
proceeds. At the start of the trial, each Xj{t) is initialized to x^At) = l/{d\pni). If a trial fails, we double 
the value of F and proceed with the next trial with new primal and dual variables. Thus in every trial, 
F > ma-Kkj.p^^yoPkj/{dipKi). 

Our final value for (x, A) is the sum of the values obtained in each trial. Thus, our variables are non- 
decreasing. Let F{t) be the value of F used in trial r, and \-^{t) be the value of the primal A(r) when 
trial T ends. T is the last trial, i.e., the algorithm does not fail in trial T. Since x obtained by the algorithm 
is the sum of x(r) in each trial r, the value of the primal objective obtained by the algorithm is at most 
E.<TA^(T).Then 

Theorem 12. The value of the primal objective X]^<y A-* (r) obtained is 0{lnmln{dpK,))OPT. 

We first show a bound on F in any trial. 

Lemma 13. In any trial, F < AaOPT. 

Proof. Initially, F = max^ j-p^, >oPkj/{dipKi) < OPT by dSjl. Hence the lemma is true for the first trial. 
Since F is doubled after each failed trial, by Lemma |9]some trial with F < AaOPF will not fail. Hence, 
for every trial, F < AaOPF. D 

Proof of Theorem\T2\ Define A^(r) := A-^(r)/r(T). By Corollary g] A(t) < 31n(em) at the start of any 
phase. Within a phase, each variable gets multiplied by at most a factor of /i = 1 + 1/(3 In (em)). Hence 
when trial r fails, A-^(r) = \f{T)/F{T) < 1 + 31n(em) < 41n(em), or A^(r) < 4r(r)ln(em). Since 
the value of F{t) doubles after each trial, 

^A-^(r) < 41n(em)^r(T) = 41n(em) ^ 2^-^r(r) < 81n(e?n)r(r) . (16) 

T<T T<T T<T 

Thus, from (IT6l) and Lemma [T3l "^Z^^x ^'^i'^) — ^^o" ln{em)OPF, proving the theorem. D 

2.3 A Lower Bound for Mixed Packing and Covering Online 

We give a lower bound on the competitive ratio of any deterministic algorithm for online mixed packing 
and covering. Given upper bounds m and d on the number of packing constraints and on the number of 
variables in any (packing or covering) constraint respectively, we give an example to show the following 
lower bound. 

Theorem 14. Any deterministic algorithm for OMPC is Q.(\og{d/ log m) log m)-competitive. 

Our algorithm for OMPC in Section |2T|is thus nearly tight. For parameters d and m, we give an exam- 
ple which has m packing constraints, at most 2d variables in each covering constraint, and at most d log m 
variables in each packing constraint. For this example, we show that OPT = 1 and any deterministic 
algorithm gets value Q.{\ogd\og m). The theorem follows. 
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We assume that both d and m are powers of 2 without loss of generality, otherwise we redefine d to 
be the highest power of 2 that is at most the given value of d, and redefine m similaiiy. Our example has 
2{m — l)d variables. We partition the variables into 2(m — 1) pairwise disjoint sets, with each set consisting 
of d variables, and use Bi to refer to the ith set. We refer to these sets as blocks. For any set of variables S, 
we use w{S) to refer to the sum of the values assigned by the algorithm to the variables in S, and use S(5) 
to refer to the expression "^^^g x. 

We first show how given two blocks Bi and B2 of size d, we can construct covering constraints so that 
w{Bi) > Hii/2 for one of i G {1, 2}, while the constraints can be satisfied by setting Xj = 1 for a single 
variable Xj € -Bj', i' / i. The covering constraints ai^e given by Algorithmic Hd refers to the dth hai^monic 
number. 

Algorithm 2 Given blocks Bi and B2 : 



B^ -^ Bi, B2 -^ B2 
for i = l^d— Ido 

Offer the covering constraint 'E{B[ U B'2) > 1 

Let xi, X2 be the variables assigned maximum value in B[ and B2 respectively 

B[^B[\ {xi}, B'2^B'2\ {X2} 
Offer the covering constraint Tj{B[ U B2) > 1 



Lemma 15. Either w{Bi) > Hd/2orw{B2) > Hd/2. 

Proof. Let xi, X2 be the variables assigned maximum value in B[ and B'2 respectively in the ith iteration 
of the for loop. Since \B[\ = |i?2| = {d + 1 — i), and xi, X2 have maximum value in B[ and B2 
respectively, xi > w{B[)/{d + 1 — i) and X2 > w{B2)/{d + 1 — i). Further, since w{B[) + w{B2) > 1, 
xi + X2 > l/(d + 1 — i). Thus when all the covering constraints are satisfied, w{Bi U B2) > Hd, and 
hence either w{Bi) > Hd/2 or w{B2) > Hd/2. D 

Assume w{Bi) > Hd/2. Then there exists some variables Xj € B2 which is in each covering constraint 
introduced, and hence all the constraints can be satisfied by setting this variable to 1. 

For the complete example, consider a complete binary tree with m, leaf nodes. Each node in this tree 
except the root corresponds to a block, and no two nodes correspond to the same block. Our packing 
constraints con^espond to the leaf nodes, with packing constraint k being E{\Ji(zQ^,Bi) < A where Qk is the 
set of blocks encountered on the path from the root to the leaf node con^esponding to packing consti'aint k. 

For a node v, let / and r be the left and right child respectively, and let Bi and Br be the blocks 
corresponding to these children. We now start from the root node and walk to a leaf node in the following 
way. When we are at node v, we run Algorithmic with blocks Bi and Br. lfw{Bi) > w{Br) we step on the 
left child and "mark" the right child, else we step on the right child and "mark" the left child. We continue 
with the node we stepped on as node v, and continue in this manner until we reach a leaf node. Then say 
the leaf node we arrive at corresponds to packing constraint k. Since each block on the path from the root 
to this leaf node (except the root) has weight at least Hd/2 by Lemma [T5l and the path from the root to 
any leaf has log n nodes, the total value assigned by the algorithm to variables in this constraint is at least 
log m ■ Hd/2, thus A > log m ■ Hd/2. 

On the other hand, setting a single variable to 1 in each marked node satisfies all the covering con- 
straints. The path from the root to any leaf node contains at most one marked node, since when we mark 
a node, the blocks in the subtree rooted at that node do not appear in any covering constraint. Thus, we 
can satisfy the covering constraints by setting at most a single variable to 1 in each packing constraint, 
where for a packing constraint, the variable set is in the marked node (if any) in the path from the root to 
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the leaf corresponding to the packing constraint. Hence OPT = 1 , and any deterministic algorithm obtains 

A > log m ■ Hd/2. 

3 UMSC and CCFL 

We now build upon the techniques in Section |2] and give a polylogarithmic-competitive integral algorithm 
for UMSC and CCFL. Recall that both UMSC and CCFL generalize online set-cover, for which there is 
a lower bound of 0(log?TT,logn) on the competitive ratio assuming BPP / NP ^. Our algorithm is thus 
tight modulo a logarithmic factor. 

CCFL generalizes UMSC ; an instance of UMSC is an instance of CCFL where each facility corresponds 
to a machine and has unit capacity, each client corresponds to a job, and all assignment costs are zero. We 
describe an algorithm for CCFL, which also gives an algorithm for UMSC with the same competitive ratio. 
Further, in CCFL, the demand pij and capacity Uj only appear as the ratio Pij/ui. We redefine pij as this 
ratio, and assume that the capacity of every facility is unity. The congestion of a facility is then the sum of 
the demands of clients assigned to the facility. 

We use [m] to denote the set of facilities, and [n] to denote the set of clients. We exclude trivial 
instances and assume m, n are at least 2. We assume that n is given offline. Variables i, i' index facilities, 
while j, j' index clients. Clients appear in order of their indices: the first client is client 1, and the last client 
is client n. The total cost Z of an assignment of clients to facilities is the sum of the maximum congestion, 
fixed-charge, and assignment costs. Z* is the total cost of the optimal assignment. We will assume we are 
given an estimate Z with Z* < Z < 2Z*. In the absence of this estimate, we use a doubling approach as 
described previously; Section l34] explains how doubling can be used for this particulai^ problem. For client 
j, Fj{Z) := {i : pij + Ojj + Cj < Z}. Since assigning client j to a facility i not in Fj{Z) would incur total 
cost larger than Z, we fix the fractional assigment of client j to any facility i not in Fj{Z) to be zero. 

We first give an algorithm that obtains a fractional solution for the problem, and then use a randomized 
rounding technique adapted from [i26il to obtain an integral solution. A fractional solution corresponds to 
a solution to linear program CCFL-LPl(Z), which takes Z as a parameter A client may be fractionally 
assigned to facilities, and the sum of these fractional assignments for each client must be at least 1. xij is 
the fractional assignment of client j to facility i. Facilities can also be opened fractionally, and yi is the 
fraction to which facility i is open. The fraction to which any facility is opened is an upper bound on both 
the fractional assignment of any client to that facility, and the ratio of congestion of the facility to Z. A is 
an upper bound on yi for each facility. Since for every facility the fraction yi is an upper bound on the ratio 
of congestion to Z, Z\ is the maximum congestion of any facility. 



CCFL-LPl(Z): 


min S Ciyi + Z\ + 


/ , O-ij^ij 




iairn] 


^e[1 


-n],iG[n] 




Z/igH ^ij 


> 1, 


Vj G [n] 




Hi Xij 


> 0, 


Vi e [m],j G [n 




Zyi - Y.j(zln]PijXij 


> 0, 


\/i G [m] 




^-Vi 


> 0, 


\/i G [m] 




A 


> 1 





X is the vector {xij)i^[ra],j&[nY ^^'^ similarly y is the vector (yOieH^ C'PTi(.Z') is the cost of the opti- 
mal solution to CCFL-LPl(Z). Since we restrict assignment of client j to facilities in Fj {Z), if Fj{Z) = 0, 
CCFL-LPl(Z) is infeasible. We assume that Z* < Z < 2Z*, and hence CCFL-LPl(Z) is feasible. Also, 
since A > 1, 
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Fact 16. OVTi{Z) > Z. 

Define p := maxj mtn'(c'+^'''+a'o • ^^ ^^^ techniques from Section|2]and obtain an 0(log(7Tin) log{mnp))- 
competitive fractional solution for CCFL-LPl(Z). We begin by highlighting the major differences between 
CCFL and OMPC, and briefly mention how they are dealt with. 

Firstly, the linear- programm CCFL-LPl(Z) no longer consists solely of packing and covering con- 
straints, since there are variables with negative coefficients. However, we can still express the objective 
of CCFL-LPl(Z) as a function of the vector x: given x that satisfies the first set of constraints in CCFL- 
LPl(Z'), define yi(x) as the minimum value of y^ that satisfies the remaining constraints; A(x) is defined 
con^espondingly. We then proceed as in MPC-APPROX: we define cost(x) as a derivable approximation 
to the objective, and rate(x) as the derivative of cost(x). rate(x) is then used to obtain the multiplicative 
updates for variables x. 

Secondly, whereas earlier rate(x) was a continuous function of x for OMPC, this is no longer the 
case for CCFL. Now rate(x) depends on whether the second set of constraints in CCFL-LPl(Z) are tight. 
We deal with this by separating the updates where the second set of constraints is tight, and increment x 
differently in each case. 

Thirdly, in Section |2] for each variable Xj since the packing constraints are available offline, the coef- 
ficients of Xj are also known offline. This allows us to initialize variables offline. In CCFL, clients anive 
online, and when each client anives, we learn its demand and assignment cost for each facility. Thus the 
coefficients dij and Cy of each variable x-ij in CCFL-LPl(Z) are received online, and these variables need 
to be initialized online. We show that the increase in cost(x) because of these initializations is small. 

Fourthly, CCFL-LPl(Z) is a parametric LP. If we do not know Z*, we use a doubling procedure twice: 
once to obtain Z such that Z* < Z < 2Z*, and once again to scale CCFL-LPl(Z) by F, as in Section|2l 



3.1 A Fractional Solution for CCFL 

We start by scahng the CCFL-LPl(Z) by a parameter T to obtain LP2{Z, F) and its dual, D2(Z, T). 
OVT2{Z, F) is the cost of the optimal solution to LP2(Z, A). In the following analysis we will keep the 
dual variable p = and exclude it from further discussion. 



LP2{Z, F): mill ^ Ciiji + ZA + — y^ a 



E 



«j 



Xij > 1, Vj 



Vi-^ > 0, Vi,j 

Zyi -^Pij-fT > 0, Vi 

A - yi > 0, Vz 

A > 1 



D2(Z, F): max y, (^j + 7;/^ 



Fuj - Pij - pij^ii 



F' 

<0, Vi,j 



y^ Ai + Z-ii -5i < Ci, Vi 
j 

^<5i + /x <Z 



Fact 17. For any Z, F > 0, (x, y, A) is a feasible solution to CCFL-LPl(Z) of cost Z' iff (x, y/^T, A/^T) 
is a feasible solution to LP2(Z, F) of cost Z' jF. 

Given a vector x, let Wi(y^ := maxjXjj/r, i)i(x) := ^■pijXij/{ZF). Define yi(x) := {'j(x) + 

Wi{yi), and A(x) := maxiyi(x). If x satisfies "^^li^ij — ^ f^i' ^11 i' "^^en (x, y(x), A(x)) is feasible for 
LP2(Z, F), where y is the m-vector of values (yi(x))jgr„i. As in Section[2l 
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est(x) := InV^ (e^J "^zr^ ) + In ( y^ e J-" 



(17) 



«j 



is an upper bound on A(x), and cost(x) := Z ■ est(x) + ^^ Ciyi{:x.) + J2i j o-ijXij/r is an upper bound on 
the cost of the solution (x, y(x), A(x)). The rate of change of cost(x) is 



rate.,(x) := ^^^ = Z^^ + ^ f ^ + 1,,U ^ 



(18) 



where ly = 1 if Xij = Fwi, and otherwise. 
Our algorithm is given as ASSIGN(I^) . Define 



X, 



- . (19) 

for all i, j. When chent j arrives, 



._ 1 miiit/ {cji + pjij + aji 
^^ 2mn Ci + pij + Uij 

At the beginning of the algorithm, before any requests arrive, set Xij - 
we initialize Xij = x^ for all i. As long as j is not fully assigned, we increment the fractional assignment 
Xij for each client i. The increment occurs in phases where a phase is a single iteration of the while loop 
in the algorithm. The phases are indexed by I. L is the set of all phase indices, and Lj is the set of phase 
indices for phases executed to assign client j. 

We 



The increase in Xij in each phase is inversely proportional to ratejj(x). Define /^i := 1 + 
also scale each update for client j by ej(x), where 



1 



6 In(emn) 



ej(x) := (fi — 1) minratejj(x) 



(20) 



This definition ensures that in each phase, the factor by which each variable is incremented is at most fi. 



Algorithm 3 AssiGN(r) : When client j arrives 



while ^- Xij < 1 do 

/ ^ / + 1, x' ^ X 
fori eFj{Z) do 

if Wi(x') > Xij /r then 

Xij ■(— min < rwi{'x!-),x. 
else 



«j 



1 + 



^.,(x') 
ratCij(x') 



Uj •<— Uj + eej (x' ) 

if cost(x) > 5Z In(emn) then return FAIL 



/* see definitions in ([T8]l, ^ *l 



/* for analysis */ 



Fact 18. At any stage in the algorithm, Xij < ^for all i, j. 

Our analysis of ASSIGN(I^) follows the primal-dual analysis in Section |2] closely. One difference be- 
tween the cunent problem and Section |2] is that since we receive requests online, we do not know the 
coefficients of variables in the packing constraints offline, and unlike Section |2] cannot initialize our vari- 
ables offline. In ASSIGN(7^) as each request is received, we obtain the corresponding coefficients, and 
initialize our variables Xij = x^ in line[T] For client j, define initj as the change in cost(x) on execution of 
line[T]when j aixives. cost(x) is initially cost(O) and increases either due to line[T]or within a phase. We 
begin our analysis by showing bounds on the change in cost(x) due to these. 
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Lemma 19. If F > 1, then initj < Z/nfor any client j. 

Proof. Fix a client j. Let x' and x" be the values of x before and after the execution of line [T] on anival of 
client j. We consider the differences Z(est(x") — est(x')) and ^^ Ci{vi{x.") + Wi{x.")) — ^^ cj({;j(x') + 
Wi{x.')) +J2i,j'0'ij'i^ij' - ^'ij')/^ separately. Since cost(x) = ^iCi{vi{x) + mj(x)) + Zest(x) + 
T Si lijXij, the sum of these differences will give us initj. 
By definition of est(x), 



est(x") — est(x') = InN^exp 



z2j'Pij'^ii' 



zr 



in y^ exp 



z2j'Pij'^ij' 

zr 



+ lnE 



exp 



ij' 



^ij' 



1-E 



exp 



^ij' 



«J 



x" and x' differ only in values for client j, and aie identical for other requests j' ^ j. Further, for all i 

and / > j, x'lj, = x'-j, = 0. Thus 



est(x") -est(x') = In^ 



exp 



z2j'<jPij'^ij' +Pij^i'j 



zr 



«j 



•"E 



exp 



z2j'<jPij'Xij' 

zr 



xL 



X 







xL 



+ In E I ^exp -^ + exp ^ + {n - j - 1) \ - In ^ E ^^P ~r~ "^ ''^ ~ -^^ 

Since -T > 1 by assumption, xfjr < l/{2mn). Further, since pij < Z for i £ Fj{Z), pijX^MZF) 
< x^J F < l/{2mn). Substituting in the previous expression for est(x") — est(x') yields 



est(x") - est(x') < In exp — - ^ 



exp 



l^j'KjPij'Xij/ 



ZF 



1-E 



exp 



/^j'<jPij'^ij' 

zr 



+ ln' 



Ei E,'<,- exp T^ + exp 2^ + {n -j-l] 



E» ( Ej'<j exp ^ + (n - j) 
I 



2mn 



+ ln 



1 + 



Ei(exp2;i--1 



^ E» ( Ei'<j exp -^ + {n-j] 



j 



< 



1 I ^^^ I Ei(exP2;^-l) ^ ^ 1 , exp 2^-1 



< 



+ 



2mn ' \ ' ^,- n I 2mn 

where the last inequality is because 1 + a < exp(a) for all a € M. Using 171,71 > 2, 



n 



Zfestfx") -est(x')) < -;- + 

4n 



Z Zexp(i)-1 



< 



n 



Z 

2n 



(21) 



15 



For the second bound, since x'l^. = x'^j, for k / j, Wi{'x") — 'Wi{'x.') < x'lJ F = x^jF. Further since 



ij/ 



Pij < Z for i € Fj{Z), Vi{x") - di{x') = ^|^ < xO./r. Hence 






Y, Q(i).(x") + zD,(x") - ^,(x') - u).(x')) + Y. ^^{x%, - x\y) < Y ^(2P 



3.0 
< 2Z^^ 

where the second inequaUty is because Cj + aij < Z for i € Fj{Z), and the last inequahty is by definition 
of x^j and since F > I and m > 2. From (|2TI ) and (l22l ). the lemma follows. D 

Corollary 20. IfF > 1, then at the beginning of every phase, cost(x) < 6Z \n{emn). 

Proof. When the previous phase ended, since ASSIGN(I^) did not fail, cost(x) < 5Zln(emn). Between 
phases, x is incremented by at most the arrival of a client, which increases cost(x) by at most Z/n by 
Lemma [m D 

We now show that the increase in cost(x) in any phase of the algorithm is bounded from above by the 
change in the dual objective. 

Lemma 21. If F > 1, the increase in ^ aj is an upper bound on the increase in cost(x) in every phase. 

Proof. We show the lemma for phase /, corresponding to request r. Let cost' and cost'"'"^ be the values of 
cost(x) before and after the variables are incremented in phase I respectively. We will show that cost'"'"^ — 
cost' < e gj (x'). Since e ej (x') is the increase in the dual objective, this will prove the lemma. 

Our proof follows the proof for Lemma |5] Let x' and x'+^ be the values of x before and after being 
incremented in phase /. In phase I, only the variables correpsonding to chent j get incremented. We ignore 
variables for the other requests, and for each Xij, let gi{t) := x\, + {x^^ — x\-)t be defined for < t < 1. 

Note that 5j(0) = x\, and gi{l) = x,^^. Define g(i) = {gi{t))i^F(Z)- With some abuse of notation, any 
function of x, say h{x), can be written as a function of t, with h{t) := /i(g(t)). Thus, the functions cost(x), 
ratejj(x), yj(x) and A(x) can be written as functions of t. In particular, ratejj(t) := d cost {t)/dgi{t). 
We use these alternate expressions in the remainder of the proof. By the chain rule, 

ticost(t) _ Y^ gcost(t) dgi{t) _ ^-^ dgi(f)_ 

dt -2^ dgi{t) dt ~ /^^■^^^^^■^'^ dt ' 

i i 

and hence, 

cost'+i - cost' = C ^^^dt = y [' rate,,(t)^dt. (23) 

Jt=o dt i 't=o dt 

The function ratejj(i) may not be continuous if for some i, i~'t()i(x'+^) = x^^^, but rwj(x') > x' ■. 
By our choice of updates in ASSIGN(/^) , the discontinuity is only at the point t = I; hence we redefine 
rateij(l) := limj_^i- ratejj(i). Since we change ratejj(i) at a single point, (1231 ) is still true. By Corol- 
lary |20l cost' < 6Zln{emn). Also, each variable gets incremented by at most a factor of ^u in a phase. 
Then by Lemma l46l rateij(t) < erateij(O) for < t < 1, hence 
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cost'+^ - cost' < eVratejj(x') / -^^dt = e Vrateij(x')(, 
~^ Jt=o "t ^ 

Since in phase / each variable is multiphed by 1 H f^-f- 






rateij(x' ' 



cost'+^ - cost' < ey^ratej,(x')— ^ —^ = ey^e,(x')x-,- < ee,(x') 

where the last inequality follows since, on entering the for loop, ^^ x\- < 1. Since eej(x') is the increase 
in the dual objective, this proves the lemma. D 



We now discuss our dual variables and show feasibility. By definition, 

(9est(x) 1 



dxij r 



Pij exp{Y^j,piyXij'/{Zr)) ex.p{xij/r) 



z T.i'^'y-viT.j'Pi'j'Xi'j'/izr)) Y^i>j>exp{xi>j>/r) 



(24) 



We use the following notation. Recall that for a client j, Fj{Z) is the set of facilities i with Cj + pij + 
dij < Z, and Xij = for any i Fj{Z). For all j and i Fj{Z), we leave the corresponding dual 
variables undefined. For facility i, rn is the index of the first client j so that i S Fj{Z). 

Vi, Zi{ni-1) := x°,. , andVj> rij, Zi{j) := max Xij> , (25) 

exp{x[-/r) 
Xij := max— j— — — - \/i G [m], ^j : i e Fj{Z) , (26) 

?7i := rnax= -= /r^rw Vi G m . 

ieL 2^., exp{}^-,pi'j>Xifj'/{Zr)) 

From (l24l) and these definitions, for any i, j and any phase I ^ Lj, 

(9est(x') 1 /py \ 



Define cj := 4e^ ln{2^mnp). Set the dual variables: 



^ + c.ln^i^ 



A, = Zxij ir^ + Ciln ;}'\^ , Vi G [yn] , Vj : f G F,- (Z) , 



ZJ 2e2 ' 



7. = ( ^^ + §) i , Vi G [m] , (28) 

Vi G [m] . 



6i = zi Yl Xij+m]^, 

\j:ieFjiZ) J 



We define Zi{j) for j G {nj — 1, ..., n} since the definition of /?/„. requires Zj(nj — 1). We now show 
bounds on the infeasibility of each dual constraint in D2(Z, F) in the following sequence of lemmas. 

Lemma 22. For all i, j, aj < jr [fiij + Pij^i + aij-^). 
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Proof. For any i, j, let Lf- be the set of phases where Wi > Xij/F before Xij is incremented, and L~ is 
the set of phases where Wi = Xij/F before Xij is incremented. Then Lf- U L~: = Lj. Let / be the first 
phase executed when Xij/F = Wi at the beginning of the phase. Then in every subsequent phase in Lj, 
Xij/r = Wi at the beginning of the phase. Hence any phase / G L^ occurs after all the phases in Lf-. 

In any phase in Lf- except the last, Xij is incremented by ( 1 + ej (x' ) /ratejj (x' ) ) . Before the last phase, 
Xij < 1. In the last phase, Xij is incremented by at most (1 + ej(x')/ratejj(x')) < /x. Hence 






Using 1 + o > e"/"^ for < a < 1, and by rean^anging the terms. 



exp 



E 



ej(x') 1 / ^ 



< 



U6L>.^^^*''^J' (''')/ 4 



Taking the natural log on both sides, and observing that x?- > l/{2mnp), 



E^^tsIf)) - '"4 - """""""'' 



and hence 



/ fj(x ) ^ eln(2/i?7in/3) maxratejj(x ) . (29) 

If L~j 7^ 0, then during the execution of phases in L~j, Xij increases from an initial value of Zi{j — 1) 
to Zi{j) after the completion of the phases in L^. If j is the first client in Ci{Z), then its initial value is x?-, 
hence Zi{j — 1) = x^- as defined in (|25] ). By a similar analysis as for Lf-, 

V e' < e(ln ,' , ] maxrate^,(x') . (30) 



ieL= 



The dual variable Oj gets incremented by eej(x') in each phase for client j. Hence, a^ = e ( ^;gj^> ej(x 



J\ 



+ E/eL= ei(xO )■ Thus, from ^ and (l30]l, 

In(2n7n7ip) max rate,, (x ) + In — j^ niaxrate,'i(x' 

Replacing the values of rateij(x') for / G Lf- and / G L~j from dTSl) . 



2 ( r, (9est(x') QPij flij \ 
a,- < e Zmax — h ^77^ + ^ \n(2pmnp) 

\ i&L> dxij zr r j 

( aest(x') CiPij Q ajA Zi{j) 

+ z max — 1 -\ 1 m ■ 



K^Lf. dxij zr r r Zi{j - 1) 
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For any client j, xij < //, and hence Zi{j) < /z. Since Xij > x?-, Zj(j) > x°-. Thus, Zi{j)/zi{j — 1) < 
fJ-fx^j < 2^mnp. Thus 

and from the expression for dest{-x.^)/dxij in (l24l ). 

a. < 2e2 (I (^r?, + x.,) + c.§; + ^) ln(2^mnp) + e^^ In -^ 



e 

r 



/ C'\ \(T Z' ( i] 

p.. (^. + i j + ^x., + a., j ^ + Q m ^^^ 



(31) 



By definition, /3jj = ZxjjO"/(2e^) + Ci In ^5' _;^-) , and 7^ = (r/j + ^) a/{2e'^). Replacing these expressions 
in (I3TI ) yields Oj < ^ (/3jj + Pjj7j + aija/{2e^)) , proving the lemma. D 



Lemma 23. For all i, 2_\ Pij + ^7i — i^i < '^^■ 

j:i&F,{Z) 

Proof. By definition (l28l) . for any i. 



2^*0') cr 2:j(n) 



^ ftj+Z7, -5i = Q^ + Ci ^ ln-^^ = Ci^+Qln 



where n is the total number of clients, and rij is the index of the first client j such that i G Fj{Z). By 
definition (|25] ). z(nj — 1) > l/{2mnp), and Zi(n) < Xjj < ;U by Fact[T8l Hence 

y^ Aj + Z'ji - 5i < 3ci ln{2fimnp) < Ci— . 

j:i&F,{Z) 



n 



Lemma 24. V](5j < Z(H- In(mn) + maxA(x'))— . 

i 

Proof. We show that 

^ Xij < 1 + ln(mn) +maxA(x^) (32) 

and 

^ 7?i < 1 + ln(m) + max A(x') (33) 



maximizes — , — — . Define hij := e '^ ' . Since variables are nondecreasing. 



which suffices to prove the lemma. For (|32] |. let j G [n] and i G Fj{Z), and define 4>{i,j) as the phase that 

max 1 — — < > ■= — 
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and by Lemma l44l this is at most 1 + ln(^ ■ i^p.rz) ^ij) ^ 1 + In(mn) + max; A(x ). 



Similarly, for (l33T l. define (p(i) as the phase which maximizes — '^ i^ '^^ — ttztT' ^^^^ define bi :- 



exp(E,K.<^'V(^/^) 



X,p..^W(^^) 



gA^j /-ij Sj 



max — ; — - — - < > 



and by Lemma l44l this is at most 1 + In ?n, + max; A(x'). D 

Let zv := (1 + ln(?n,n) + max; A(x')). Then 

Lemma 25. For x obtained by AssiGNfrj , the vectors a' = ajiya), /?' = e^l^/^ua), 7' = 6^7/(7^0-) 
and 5' = e^5/{v(T) are feasible for D2( Z , F). 

Proof. We show that the constraints in D2(Z, F) are satisfied by (a', /3', 7', (5'). For the third constraint in 
D2(Z, T), from LemmaEl J^i ^i = Ei ^te'^/(.<^^) < Z 
For the second constraint, 



where the first inequality is because v > I, and the last inequality is from Lemma |23] 
For the first constraint, 

where the last inequality follows from Lemma |22] and since u > 1/2. Hence, (a', /3', 7', 5') are feasible for 
D2(Z, T). D 

We now use the primal-dual framework to show that given Z, F such that 4(T ^ > F > 

2(7 ^^ , our algorithm will succeed, and bound the competitive ratio obtained for LPl(Z). 

Lemma 26. IfCCFL-LPl(Z) is feasible and ^ > '^'P'^^^^\ then AssiGNfrj does not fail. 

Proof. Let (x^,y'^, V) be the current values for LP2(Z, F) and {a^ ^j3^ ,'^f ,5^) be the current values 
for D2(Z, F) when the condition in fine [10] is being checked. Let (x*, y*, A*) and (a*, /3*, 7*, 5*) be the 
optimal primal and dual solutions for LP2(Z, F) and D2{Z, F). Then 

ovT2{z, F) = zy + Y, c^y: + Yl «u-4/^ = E "i (34) 

where the second equaUty is because of LP strong duality. From Lemma |25l a^ /{ua) is feasible for the 
dual. Hence Ei ck^ ^ E7 '^i /i'^^)- Then from (l34l ). and from Fact [TT] 



^j J — ^^j J ' 



OPri(^) = FOVT2{Z,F) > rj^aj/(i/a). 
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Since ^ aj is an upper bound on the change in cost(x) in each phase, ^ a- > cost(x-'^) — cost(O) — 
Y^ initj . Thus 

uaOVTiiZ) > r I cost(x-^) - cost(O) - J^initj ] . 

OVTi{Z)/Z > 1 by Fact [16] Hence F > I by the condition in the lemma statement, and thus from 
Lemma [T9l and since cost(O) < 2Zln{mn), 



uaOVTi{Z) > r(cost{xf)-2Z -2Zln{mn) 
By definition, v = I + ln{mn) + A(x-^), and A(x-^) < cost(x''^)/Z. With these substitutions, 

aOVTiiZ) nn(emn) + ^°^^^ M > F ('cost(x-^) - 2Z - 2Zln{mn)) , (35) 

and from the bound on F in the lemma statement, 

-Z [in(emn) + ^"'^ 1^ ' j + 2Zln(mn) + 2Z > cost(x^) . 

Simplifying yields |Zln(em?i) > icost(x''^). Hence, cost(x''^) < 5Zlii{emn) and the algorithm will not 
fail. D 

Lemma 27. IfCCFL-LPl(Z) is feasible and ^%'^^^ <^< 2^^5^^' AssiGNfrj returns a solution 
to CCFL-LPl(Z) of cost 0(ln(mn) ln{mnp))OVTi{Z). 

Proof Let (x-^, y-^, A-^) be the solution for LP2(Z, F) our algorithm returns. By Lemma l26l AssiGN(r) 
does not fail, and hence cost(x-'^) < 5Zln(emn). Substituting this bound on cost(x'^) in the expression on 
the left in dSS]) yields 

aOVTiiZ) 6ln{emn) > F ('cost(x-^) -2Z - 2Zln{mn] 
or 6ln(emn)aOVTi{Z) + 2ZFln{emn) > /^cost(x-^). Substituting the upper bound on i^, 

6aln{emn)0'PTi{Z) + 8(Tln{emn)OVTi{Z) > rcost(x^) . 
Since i~'cost(x'^) is an upper bound on the cost of the solution obtained for CCFL-LPl(Z), the proof 

follows. n 

The following theorem now follows immediately from Lemmas |26] and l27l 

Theorem 28. IfCCFL-LPl(Z) is feasible and Z, F satisfy 2^^^^ ^ ^ ^ ovr^^{Z) ^ ^^^^ AssiGNfTj 
does not fail and returns a solution to CCFL-LPl(Z) of cost 0(ln(mn) ln(mnp))0'PTi(^). 
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3.2 A Doubling Procedure for F. 

If we are not given F that satisfies the conditions of Theorem |28j we use a doubling procedure similar to 
that described in Section IZ21 Initially set F = 1 and iiin AssiGN(r) . Each execution of AssiGN(r) is 
called a trial, and each trial r has a distinct set of primal and dual variables (x(r), v(r), w(t), A(r)). In 
each trial, Xij{T) is initialized to x^ for each client that arrives during that trial, and the other variables are 
updated accordingly. If a trial fails, we double F and proceed with a new trial with a new set of primal and 
dual variables. We continue in this manner, doubling the value of F after each failure, until all clients are 
assigned. The cost of the solution we obtain is then at most the sum of the costs obtained in each trial. 

Let A{Z, F) be the cost of the solution to LP2(Z, F) obtained in each trial. Hence, by Fact [T7J 
FA{Z, F) is the cost of the solution to CCFL-LPl(Z) in each trial. Define A{Z) as the sum of the (partial) 
solutions to CCFL-LPl(Z) in each trial. Thus, A{Z) := Y,r rA{Z, F). 

Theorem 29. If CCFL-LPl(Z) is feasible, AssiGNf/^j with the doubling procedure for F obtains a frac- 
tional solution to CCFL-LPl(Z) of cumulative cost 0{\n{mn) \n{mnp))0'PTi{Z) over all trials. 

Proof By assumption, CCFL-LPl(Z) is feasible. Initially, T = 1 < OVTi{Z)/Z since A > 1. Since we 
double F each time AssiGN(r) fails, and by Lemma[26]AssiGN(r) will not fail for F > 2aOVTi{Z)/Z, 
F < AaOVTi{Z)/Z in any trial. Further, in any successful trial, A{Z,F) < 8Zln(emn). For any 
failed trial r, at the beginning of the phase when the trial failed, A{Z,F) < cost(x) < 6Zln(3mn) by 
Corollary |20] In one phase, each variables Xij is incremented by at most a factor of /i = 1 + 1/(6 ln{emn)). 
Thus in any failed trial, A{Z, F) < p6Zln{emn) < Z{\ + 6In(emn)). 

Thus over all trials, the cumulative cost of the solution to CCFL-LPl(Z) A{Z) is at most Z{1 + 
6 ln(e?TT,n)) '^p F. Let F^ be the value of F in the final trial. Since F is doubled after each trial, A{Z) < 
Z{1 + 6ln{emn))2FF Since F-f < A(tOVTi{Z)/Z, and a = Ae^ ln{2p,mnp), the theorem follows. D 

3.3 Obtaining an Integral Solution 

We will now build upon the fractional algorithm for CCFL and give a randomized rounding procedure 
that obtains an integral assignment of clients to facilities. As before, we will assume that the fractional 
assignment Xij of client j to any facility i not in Fj{Z) is always zero. We first give a rounding procedure 
that uses Theorem|29]and obtains an integral assigment of clients to facilities. We will then show how to use 
this integral assignment procedure for a fixed parameter Z to obtain an 0(ln^(m?i) In (mn/))) -competitive 
solution to Z*. 

We run our rounding procedure whenever a new client j arrives, and given a fractional solution (x, y , A) 
for CCFL-LPl(Z) that satisfies ^^ Xij > 1. The procedure returns a set of open facilities and an integral 
assignment of j to an open facility. 

We assume Xij < 1 without loss of generality. For each client j, let S{j) := {i : Xij > l/{2m)}. Cj 
is a set of candidate facilities for the integral assignment for cUent j. Initially, Cj = 0. O is the set of 
facilities opened so far, and O = initially. 

Our randomized rounding procedure is as follows. For each facility i, select r = \Ae In n] random 
variables uniformly at random between and 1; let tn, ti2, . . . , U^r be these random variables for facility 
i, and tj := min^ tj^. When client j arrives. 
Step 1: For each facility i ^ O, add i to O if yi > ii. 

Step 2: For each facility i £ Sj, add i to Cj independently with probability Xij/y-i. If i G Cj, then i is a 
candidate for j. 

Step 3: For each facility i G O, assign client j to i if i is a candidate for j. Denote an assignment of client 
j to facility i by j ^ i. If j is assigned to multiple facilities, pick one arbitrarily. 
Step 4: If J is not yet assigned to any facility, assign it to facility i e Sj that minimizes q + aij + pij. 

Steps 3 and 4 thus give an integral assignment of clients to facilities. We show: 
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Theorem 30. Tlie integral assignment obtained has total cost 0(ln(mn) In(mnA) ln{mnp))0'PT{Z). 

We first show that with high probability, no client is assigned in Step 4: 
Lemma 31. For any client, the probability that it is assigned in Step 3 is at least 1 — 1/n?. 
Proof. For a client j that has just arrived, consider a facility i ^ Sj. In Step 3, 

Pr[j — > i] = Pr[i is open] • Pr[i G Cj] 



(1- 


-Pr[y,<t,])^ 


(1- 


- (1 - mY) ^ 

Vi 


(1- 


g-yir\ ^ij 



> rxij/e 

where the first inequality is because (1 + x) < e^ for all x € M, and the second inequality is because 
e~^ < 1 — (x/e) for < x < 1. Thus, the probability that j is not assigned to a fixed i G Sj is at most 
1 — (rxij/e) < e~''^»j/'^. Since these probabilities are independent, 

Fr[j is not assigned] = TT Pr[j is not assigned to i] 

For any i Sj, Xij < l/(2m), hence Yli^s- ^v — ^/'^- Thus J2ies- ^v — -'^Z^' ^^'^ since r > 4elnn, 
the probability that client j is not assigned to any facility in Step 3 is at most l/(n^). D 

Since each client is assigned in Step 3 with high probability, the effect of Step 4 on the total cost of 
the integral assignment is negligible. The following lemma follows immediately from Lemma[3T]and since 

SjCF,{zy. 

Lemma 32. Tlw assignments in Step 4 increase the total cost of the integral assignment by at most Z/n. 

We now show bounds on the total cost for assignments in the remaining steps. We first bound the 
expected fixed-charges and assignment costs. 

Lemma 33. The expected sum of fixed-charges X^jg^ Q is at most r ^^ Qyj. 

Proof. The probability that facility i is in O is Pr[yj > tj] < X^^=i Pr [y, > tjfc] = fVi by the union bound. 
The lemma follows. D 

Lemma 34. The expected assignment costs for clients assigned in Step 3 is at most r ^- aijXij. 

Proof. For facility i, the probability that z is in O is at most ryi by the proof of Lemma l33l For any client 
j assigned in Step 3, Pr[j — )• i] = Fr[i is open] • Pr[z G Cj] < rxij. The bound on the expected assignment 
cost follows. D 
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We now prove the bound on the expected maximum congestion for the integral assignment. Define 
the candidate congestion for a facility i as Li := Ylj-ieC Pij- ^'^^ ^^y realization of the random bits, 
the candidate congestion of any facility is an upper bound on the actual congestion for clients assigned 
to the facility in Step 3. We will prove an upper bound on the expected value of the maximum candidate 
congestion over all facilities, which will give us a bound on the expected value of the maximum congestion. 

We consider the maximum candidate congestion instead of the actual maximum congestion because for 
a fixed facility i, the actual assignments of the clients are not independent of each other. If client j — 1 is 
assigned to facility i, then the facility must be open, and hence the probability that client j is assigned to i 
increases. However, for any facility i and clients j / j', Fr[i € Cj] and Pr[i € Cji] are independent. 

For the next lemma, for any client j that arrived in the current trial, define yi{j) as the value of yi when 
the randomized rounding procedure was executed for client j. 

Lemma 35. The candidate congestion Lf on any facility i at most Z ln(2emA) in expectation. 
Proof. Consider a fixed open facility i. 



(c)i _ V^ PijXjj 






For any client j. 



1 ^-^ 



^ ^7r(:i)2^Pir^^j'■ (36) 

By Lemma|47]with P = E,:iGS ^^ ^^^ ^ ^^ defined here, 
where k is the first client j such that i G Sj. Then yi{k) > Xj^ > l/{2m), and yi{n) < X. The lemma 

follows. n 

To bound the maximum candidate congestion, we use the following inequality: 



Lemma 36 ( II23II ). Let Xi, . . . , Xn be independent random variables with Pr(Xj = 1) = qj, Pr(Xj = 0) 
= 1 — Qj. For X = X]?=i (^j-^j' define v = X]"=^ a?(7j and a = maxj Oj. Then 



Pr(X > E{X) +fi)<e 



^l 

2^+'^ 



Lemma 37. The maximum candidate congestion is at most AZ ln(2emA) in expectation. 

Proof. Fix facility i. For each client j, let aj = Pij/Z if i € Sj, and aj = otherwise. Hence aj < 1. 

Define random variable Xj = 1 if i £ Cj, and X := ^ o.jXj = ~z~^ since Cj C Sj. Let qj := 
Pr[i € Cj]. FromLemmalM] E{X) < ln(2emA). 

We will use Lemma [36] to show that with high probability, the candidate congestion L^ on facility i 
does not exceed thrice the expected value. Let fi := 3 ln(2e77iA). Then from Lemma [36l 



Pr(X > 31n(2emA)) < e ""+' 
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Since Uj < 1 for all j, a < 1. Also, u = J2j O'JQj ^ J2j '^jQj = ^{^) — /^/3- Then 



Pi-(X>31n(2emA)) < e ^+^ 

< e 4 

< (2emA)-2 < _1_ (37^ 

where the last inequality is because A > 1 by CCFL-LPl(Z). Thus, the probability that the candidate 
congestion of a fixed facility exceeds 3Zln(2e77iA) is at most l/{Am?), and by the union bound, the 
probability that the candidate congestion of any facility exceeds 3Z ln(2e7nA) is at most l/(4?7i). To get the 
bound on the expectation of the maximum candidate congestion, we observe that the candidate congestion 
of any facility can at most be J2ji£S P'^j — '^^^'^'l2i Pij^ij /ViU)' since Xij > l/{2m) for any client j 
with i e Sj, and yi{j) < A. From the constraints in CCFL-LPl(Z), YljPij^ij/yiU) — -^' ^^'^ hence the 
candidate congestion is bounded by 2mAZ. Thus the expected value of the maximum candidate congestion 
is almost 3Zln(2emA)(l - 1 / {Ae^ tti^ X'^)) + 2mXZ/{Ae^m^X'^) < AZln{2emX) using A > 1. D 

We now use the bounds on the congestion, fixed-chaiges and assignment costs to prove the bound on 
the expected total cost from Theorem l30l 

Proof of Theorem UU\ By Theorem |28] and Lemmas |33] and l34l the sum of the fixed-charges and assignment 
costs for the integral assignments is 0(lnnln(?7in) ln(?7inp)) OVT{Z) in expectation. By Lemma [37] 
the maximum congestion is 0{hi{mX))Z < 0{lnm)OVTi{Z) in expectation, since OVTi{Z) > ZX. 
Assignments in Step 4 add at most Z/n to the total cost by Lemma l32l Summing up, the total cost of the 
integral assignment is 0{\ii{mnX) \n{rnn) \n{mnp)) OVTi{Z) in expectation. D 

3.4 A Doubling Procedure for Z 

We now use the rounding procedure with a doubling argument and describe an algorithm for the CCFL 
problem. We assume we are given a procedure that, given Z, maintains an integral assigment of clients to 
facilities of total cost 0(ln'^{mn) ln{mnp)) OVTi{Z). Let Q(.Z') denote this procedure, and let cq(Z) be 
the expected total cost obtained by this procedure. We start with Z = minj{ci +Pij + ajj}, and run Q{Z). 
Call each execution of Q_{Z) an epoch. In each epoch, Q(Z) uses a distinct set of variables. If cq(Z) 
> 0(ln^(?TT,n) ln{mnp))Z in an epoch, or if for some client j Fj{Z) = 0, we fail Q{Z), double the value 
of Z, and run Q{Z) for the clients not yet assigned with the new value of Z and a new set of variables. 

We start by showing that if Z is at least Z*, then Z and OVT{Z) are close, and bounding the total cost 
returned by the procedure 



Lemma 38. If Z > Z\ then OVT{Z) < 2Z and cq{Z) < 0{ln^imn)ln{mnp))Z. 

Proof. li Z > Z*, then CCFL-LPl(Z) is feasible, since for every client j, Fj{Z) / 0. Consider the 
solution to CCFL-LPl(Z) that sets yi = 1 for every facility that is open in the optimal solution, and 
Xij = 1 if client j is assigned to facility i in the optimal solution. Set A = 1. Since Z* is an upper bound 
on the congestion of any facility in the optimal solution and Z > Z*, this gives a feasible solution to 
CCFL-LPl(Z). Then the sum of the assignment costs and fixed charges for this solution are at most Z*. 
Also, ZX = Z. Hence, the total cost of this solution is at most Z + Z* < 2Z, and hence OVTi (Z) < 2Z. 
The bound on cq{Z) follows from Theorem l30l D 

Finally, using Q{Z) with the doubling argument described, 
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Theorem 39. The sum over all epochs of the expected total cost in each epoch is 0(ln {inn) hi{innp))Z*. 

Proof. Let Zf be the value of Z in the final epoch. By Lemma [38] and our failure condition for an epoch, if 
Zf > Z* then the epoch must succeed. Since Z is doubled after every failed epoch, Z^ < 2Z*. For every 
epoch, cq{Z) < 0(ln^(?7in) ln{mnp))Z, since otherwise we would have failed the current epoch. Since 
we double Z on every failed epoch, the sum over all epochs of the total cost in each epoch is J2z "^qI-^) 
< 0(ln^(?TT,n) ln{ninp)) J2z ^' where the sum is over the values of Z in each epoch. Since Z is doubled 
after each epoch, J2z ^ — '^^'^ — ^^*- Hence, the total cost over all epochs of the integral assignment 
obtained by our algorithm is 0{\v?{nin) ln{mnp))Z*. D 

3.5 A Lower Bound for UMSC and CCFL 

In this section we give a lower bound on the competitive ratio for bicriteria results for UMSC. These lower 
bounds on bicriteria results motivate the problem of minimizing the sum of makespan and startup costs for 
machine scheduling that we study in Section [3] 

CCFL generalizes UMSC, and our lower bound extends to bicriteria results for CCFL as well. Let 
T* be the makespan of an assignment of jobs to machines, m and n be the number of machines and jobs 
respectively, and p as defined in Section [S] Let C* be the optimal startup cost of an assignment with 
makespan T*. We show 

Theorem 40. No deterministic online algorithm can obtain a solution with makespan o{m)T* and startup 
cost within a factor polylogarithmic in m, n, and p ofC*, even ifT* is available offline. 

Our lower bound is in fact for any fractional solution that allows jobs to be assigned fractionally to 
machines, and machines to be fractionally opened. For each job, the sum of the fractional assigments to 
machines must be at least 1 , and the fraction by which any machine is opened must be at least the fractional 
assignment of any job to the machine. 

Our example has m machines, and 2(m — 1) jobs. We choose T* = m. The sequence of job arrivals 
is fixed, and known to the online algorithm; the only freedom we allow is that we could stop sending jobs 
from the sequence at any time, and send trivial jobs with pij = for all i instead. We will show that no 
deterministic online algorithm can obtain a startup cost that exceeds the optimal startup cost by factor at 
most polynomial in m, and a makespan at most T*m/2, even in a fractional solution. 

The cost of machine i in our example is e"*^*"^-'. Thus machine 1 has cost 1, and machine m has cost 

2 

gm -m jj^g index of each job corresponds to its arrival in the sequence. Thus job j, if it arrives, is the jth 
job to arrive. The jobs are of two types, even jobs and odd jobs, corresponding to their index. An odd job j 
can be assigned to either of two machines: machine 1 with processing time T*, or machine (j + 3)/2 with 
processing time e. An even job j can only be assigned to machine (j + 2)/2. 
We start with an observation about the optimal assignment. 

Lemma 41. For jobs 1, . . . ,k with k even, there is an assignment of these jobs on machines 2, . . . , (A:+2)/2 
with makespan T*. 

Proof. Assign each odd job j to machine (j + 3)/2, and each even job j to machine (j + 2)/2. Then 
no job is assigned to machine 1, and no job gets assigned to machine i for i > {k + 2)/2. Also, each 
machine 2 < i < (fc + 2)/2 gets assigned at most one odd job with processing time e, and one even job 
with processing time T* — e. The lemma follows. D 

Suppose now that an odd job j arrives. Then 
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Machines 


1 


2 


3 


4 


5 


Jobl 


T* 


e 








Job 2 




T* -e 








Job 3 


T* 




e 






Job 4 






T* -e 






Job 5 


T* 






e 




Job 6 








T* -e 




Job 7 


T* 








e 


Jobs 










T* -e 



Table 1: The example depicted for 5 machines. All missing entries are oo. 

Lemma 42. Any fractional algorithm that obtains startup cost o{e™ /m) of the optimal startup cost must 
assign job j to machine 1 with fractional value > 1/2. 

Proof. Let k := (j + 3)/2. By Lemma |4T1 jobs 1, . . . , j — 1 can be assigned to machines 2, . . . , k — I 
with makespan T*. Job j can be assigned to either machine 1 with processing time T*, or machine k with 
processing time e. Thus in an optimal assignment of jobs 1, . . . ,j, these jobs can be assigned to machines 
1, . . . , A: — 1 with makespan T*. The online algorithm cannot assign j to machine k with fractional value 
> 1/2, since the startup cost would exceed 6^*^/2, while the startup cost for the optimal assignment is at 
most me^^~^^"^. Thus, the algorithm must assign j to machine 1 with fractional value > 1/2. D 

From Lemma|42j every odd job must be assigned to machine 1 with fractional value > 1/2, and hence 
the makespan obtained must be at least m/2, since in our example we send 2(m — 1) jobs. Thus, any 
deterministic algorithm must have makespan at least m/2, or startup cost il(e"Ym). 
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A Appendix 

We present technical lemmas which are used in various proofs. 

Lemma 43. Let y := X]^=i tj, where < r^ < Ifor i £ [n], and n"=i ^i = ^- Then y is minimized when 

Tj = pi/" \/i, and the minimum value is nP^'^. 

Proof. The proof is by induction on n. For n = 1, the lemma is obviously tme. Let ^k{P) be the minimum 
value of the sum of k variables, when the product of the variables is P. Then ^n{P) = mino<ri<i{?'i + 
7„_i(P/ri)}. By the inductive hypothesis, 7„„i(P/ri) = (n — l)(P/ri)^' '•"'~^). Hence 

r /p\i/("-i)' 

7„(P) = min lri + {n-l)[ — 
0<ri<l I V^i, 

We will show that the expression on the right is minimized when ri = P^/". Then by the inductive 
hypothesis, each of the other variables is P^/" as well, completing the proof. 
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/ Nl/(n-l) 

Let z = ri + (n - 1) -^ . Then 



dz 

dri n 

and setting dz/dri = 0, we obtain ri = P^/". Further, d'^z/drf > Vri > 0. Hence, the point ri = P^/" 
is a minimum. This completes the proof. D 



1 - ^r-"/('^-^)(n - l)pi/{n-i) , 



Lemma 44. for an)' ?7, € Z_|_ anii ai, 02, . . . , an € M>o w/?/j ai > 0, 

En 



«e n 



Ej<iaj ai 



Proof. For n = 1, the statement is trivially true. For n > 2, define 6j = ^,<j aj. Then ai = hi — 6j_i for 
i > 2, and hence 



Oi -, . ^h- bi-i , . sr^ f , h-i 



^i<i^l i=2 "' i=2 



ie[n 

Let n = 5^, and let y = ^^2(1 " ^)' then y = Er=i'(l " n) = (n - 1) - EH'i' H- Since each 
Tj < 1 and nr=^i '^j = bi/bn, by Lemma l43l 

/ft Xl/(n-l) 

y < (n-l)-(n-l)f ^j . (39) 

Let c = ^ and 2; = (n — 1) — (n — l)c^'^"~^^ Differentiating z w.r.t. n, 

|i = i_cV(-i)+^^__lnc 
on n — 1 

and again d'^z/dv? = —c^i^'^~^>\v?c/{n — Vf' < 0. Hence, z is maximized when (n — 1) — (n — 
]^^gi/(n-i) _ gi/(n-i) jj^ 1 Substituting the expression on the left in this equality in ( [39l ) gives us y < 

gl/(n-l) ^j^ 1 ^j^j ^jj^^g c = ^ < 1, 

C O7J, 

y<lni = ln^ = lnSi^. (40) 

c 61 Ol 

Then from (|38] ) and (l40l ). and by definition of y. 

En 

n 

Lemma |45] is used in the proof of Lemma|5]in Section|2l As in Section |2j /i := 1 + tjj^t^;^- 
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Lemma 45. Given x' and x" with A(x') < 3ln{em) and x'- < x'' < fix', where // = 1 + g^^^^y 
rj(x") <erj(x'). 

Proof. By definition of r(x) in ([3]), and since x' < x" < ^x' , 

rj(x ) = -^ < -^ = . (41) 

EfcGM exp(Px")fc Hkeim] exp(Px')fc 

Since A(x') < 31n(e?n), V/c, (Px')fc < 31n(em), and hence /u(Px')fe = (Px')fc + (Px')fc/(31n(e?n)) 
< (Px')fc + 1. Substituting (Px')fc + 1 for ^(Px')fc in (gB yields 

, „. ^ EfceHPfcjexp((Px')fc + l) EfcGMPfcJ exp(Px')fe 

Tj (x ) < i— i ^ = e i— ' = = evj (x ) , 

EfceM exp(Px')fc Efce[m] exp(Px')fc 

proving the lemma. D 

Lemma l46l is used in the proof of Lemma |2T] in Section |3] Define /x := 1 + g^^/^^y We assume 
that x', x" satisfy x^^ < x'/^ < ^x[j for all i, j; that cost(x') < QZ\a{emn), and that rwi{-K') = x'^j iff 
rwi{-x.") = x'lj for all i, j. 

Lemma 46. For alii, j, ratejj(x") < eratejj(x'). 

Proof. Given x', x" as defined in the lemma, and a fixed facility i, we will show that 



T.i> exp(Ej' li'j'x'l,.,/{Zr)) J2i' exp(Ej' li'j'x'-,j,/{Zr)) 

and 



x"./r x'.Jr 



< e -^^. (43) 






Since the other terms in rate,j (x) are constants, this will prove the lemma. 

Since ZA(x') < cost(x'), and cost(x') < 6Zln{emn) by the condition in the lemma statement, 
A(x') < 61n(emn). Hence J2j' hj'x[j,/{Zr) < A(x') < 61n(emn), and x[Jr < A(x') < 6hi(emn). 
Thus 

^f ^'J'^ij' < ^li^^fk = (l + —^—] ^J' ^'^''^^J' < ^i' ^^-^'^^J' + 1 (44) 

and 



zr - ^ zr V 6in(em7i)y zr - zr 



^ < M^ = 1 + 77^7 A^ < ^ + 1- (45) 

1 I \ bm[emn)J I I 

Then (0211 and (l43]l follow from (gill and (|45]l respectively. D 

For the next lemma, we are given p, x and u G M" , with the elements of u non-decreasing. For 
1 < k < n, define 

k 



111. ^ — ' ■' 
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and T := max/j T^. We also define 

n 

PjXj 



^-E 






Lemma 47. WifY/i P, T a«6f u defined as above, P < T ( 1 + In ^ 

Proof. We first obtain a different expression for P, and then relate the expression we obtain to T. 



I L IV IV /-I -I 



n— 1 ^ \ ^ fc n—l / \ 



k=l ^ '^'^^' "- j=l fc=l 

The expression on the right in (1461) is exactly the same as the expression on the right in (1381 ). with 

Uk + 1 ) ' 

n 



bi-i = Ufc. Then from (l40l ). since c = ui/un and y = X^^^^ ( 1 — 



P < T ( 1 + In 

Ui 
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